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Introduction 

In the vicinity of the zero fixed point0 of the differential equation 

x = Ax + o{x) (xeW 1 ), (1) 

the behaviour of other solutions is mostly determined^ by the location of the eigenvalues 
of the matrix A with respect to the imaginary axis. For instance, if the spectrum of the 
matrix A lies in the open left half-plane of the complex plane, then the fixed point is 
asymptotically stable. It is unstable if the matrix A has at least one eigenvalue with 
positive real part. If we know that the matrix A has no purely imaginary eigenvalues, 
then the local structure of the phase portrait of the system (pp) can be determined by the 
number of eigenvalues of the matrix A in the left and right half-planes of the complex 
plane. For all these reasons, matrix theory methods and techniques for answering such 
questions are of great interest in stability theory. Since the spectrum of the matrix A is 
the set of all roots of its characteristic polynomial p(X) = detfAJ — A), then the same 
questions are to be answered in the theory of polynomials, toojj 

A polynomial p is called stable if all its roots lie in the open left half-plane. The Routh- 
Hurwitz problem consists in finding conditions of polynomial stability and, generally, in 
the study of those properties of the polynomial that are in some way connected with 
location of its roots with respect to the imaginary axis. 

The Hurwitz criterion is traditionally viewed as the main result on stable polyno- 
mials. It will be discussed in §[6l The practical use of this theorem is usually limited, 
in the context of direct computations, to polynomials of low degrees (3rd, 4th, or 5th). 
In fact, the Hurwitz criterion is only one of the facts of a compact algebraic theory - 
a theory that contains other practically useful results, that is related to important and 
interesting chapters of algebra and analysis, and, finally, that is beautiful. The last point 
is important. It is most difficult to master the art of producing mathematical results, of 
posing and solving problems, but it is an art worth learning^. Unexpected ideas, subtle 
arguments are seldom fruits of pure imagination; more often, they are results of observa- 
tion, perseverance, and good taste of their author. That kind of experience comes with 
learning things that are worth emulating. The theory of stable polynomials provides a 
great sample of this kind. Within this theory, everyday mathematical notions and ideas 
interact, reshape themselves, and bring about new realms of possible applications. Taking 
these didactic ideas to heart, the author did not intend to simply give a standard list 
of facts, but instead to show the development of this mathematical theory, so that the 



1 Lit.: point of equilibrium [translators' remark]. 

2 In the so-called critical case, the nonlinear term o(x) «gets voting rights» and influences the be- 
haviour of solutions in an arbitrarily small vicinity of the fixed point. 

3 The transition from a matrix to its characteristic polynomial is far from being harmless. Firstly, 
matrix properties that may have an influence on its spectrum may be lost or hidden as a result. For 
example, it is easy to establish that all eigenvalues of a symmetric matrix must be real, but it is more 
difficult to understand how this property affects the coefficients of its characteristic polynomial. Secondly, 
the characteristic polynomial is useful only for general theoretical questions and does not easily submit 
to numerical computations and analytical derivations. 

4 Both terms became customary but they are rather unsatisfactory. The former is not good because 
there is a «stable» fixed point of a system of differential equations not of a polynomial. The latter is 
also bad, since the first person who posed the «Routh-Hurwitz problem» and who obtained fundamental 
results in this area was in fact C. Hermite. Here is the chronology of works: C. Hermite — 1856, E. J. Routh 
— 1877, A. Hurwitz — 1895. 

5 The reader is referred to textbooks [23j-[24j and problem book |25L I26j . 
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reader may became a participant in its re-creation. Acknowledging that the interest of 
some student readers may be quite pragmatic, the author at the same time tried to sepa- 
rate the basic material, which one ought to learn in any case and which is presented very 
tersely, from discussions and additional points made in remarks, problems, footnotes etc. 
Incidentally, one can learn to apply the Hurwitz theorem by solving the following fairly 
typical problem. 

Problem. Find all fixed points of the Lorentz syste 

X = aY-aX, Y = rX -Y -XZ, Z = XY-bZ. (2) 

(a, r, b are positive parameters). Get to know the statement of Hurwitz' theorem in §{B 
and apply it to investigate the stability of the fixed points found. 

A solution to this problem is given in the Appendix. It is however recommended that 
the reader obtains this solution on her/his own or at least tries to do so. 

Acknowledgment 

The author thanks Mikhail Tyaglov, who pointed out a number of typos in the first version 
of these notes; those typos are corrected in the present version. The author is also grateful 
to Olga Holtz for her active interest in these notes and assistance with their publication. 

1. Stodola condition 

One of the most basic but rather useful facts on stable polynomials is contained in the 
following theorem, which is usually attributed to the Slovak engineer A. Stodola (1893). 

Theorem 1 (Stodola). // a polynomial with real coefficients is stable, then all its coeffi- 
cients are of the same sign. 

Proof. The roots of a real polynomial are symmetric with respect to the real axis. 

Let 

p(z) = a Y\ (z - Xj) ■ JJ (z - a k - if3 k ) (z - a k + if3 k ) , 

j k 

where Xj are the real and a k ± i(3 k are the nonreal roots of the polynomial p (note that 
Xj,a k < 0). Since the binomials z — Xj and the trinomials z 2 — 2a k z + {a 2 . + f3%) have 
positive coefficients, their product has the same property. □ 

The role of Theorem Q] is quite clear: it provides a very easily verified necessary con- 
dition of polynomial stability. It cannot be reversed, except for very low degrees: 

Problem 2. A quadratic polynomial with positive coefficients is stable. 

In general, the following partial converse holds: 

Problem 3. A polynomial of degree n with positive coefficients has no roots in the sector 

i 7T 

argz < — . (3) 
n 

Hint: Consider a broken line with n+1 segments whose kth segment is parallel to the 
vector a k z n ~ k (k — 0, . . . , n). If argz is too small, this broken line cannot be closed. 



This system is related to one of the classical hydrodynamical problems, viz., the onset problem for 
convectional motion in a fluid horizontal layer heated from below. The Lorentz system is interesting due 
to the fact that its trajectories have very complex behaviour for certain values of parameters. 



8 



Problem 4. A polynomial p(z) = a z n + a\z n ~ x + • — V a n -\z — a n (ao, . . . , a n > 0) has 
exactly one root on the positive half-axis; it is smaller than the absolute value of any other 
root of p. 

The question how the signs of the coefficients affect the root distribution 

OF A POLYNOMIAL IS QUITE INTERESTING per Se (SEE [HI part V, Chapter I]), BUT LEADS 
AWAY FROM THE TOPIC OF STABLE POLYNOMIALS. ON THE OTHER HAND, THE CONSTRUC- 
TIONS OF OUR NEXT SECTION TURN OUT TO BE VERY FRUITFUL. 

2. Nyquist-Mikhailov hodograph 

Let p(z) be a polynomial of degree n. In the complex plane C, consider the curvdl 

T p = { C n p(iu) : w6l}. (4) 

As the parameter u runs from — oo to oo, the curve is traversed in a certain direction. This 
oriented curve is called the Nyquist-MikhailoA0 hodograph^, or simply the hodograph, or 
the amplitude-phase characteristic of the polynomial p. 

Assume that the polynomial p has no roots on the imaginary axis. In this case, T p 
does not go through zero and the function 

ifpiuj) = Argi~ n p(iuj) = ImLog i~ n p(iu) (uj E M.) (5) 

is continuous at each point of the real axis. Note that this function is defined up to an 
additive constant of the form Ink (k E Z) , and its values do not have to lie in the interval 
[0,27r]. In the sequel, we will be interested in the incremen E 



A p = <^p|_ oo , (6) 
which is defined unambiguously. 

Lemma 5. If p(z) = z — X (Re A ^ 0), then A p = —it sign Re A. 

Proof. The hodograph T p is a horizontal line traversed from left to right that in- 
tersects the imaginary axis at the point iReX. Obviously, as u runs from — oo to +oo, 
the radius-vector of a point on the hodograph makes a clockwise turn of magnitude 7r if 
Re A > (counter-clockwise if Re A < 0) . □ 

Theorem 6 (Hermite). If a polynomial p has n_ roots in the left half-plane and n + roots 
in the right half-plane but no roots on the imaginary axis, then 

A p = n(n_-n + ). (7) 

Proof. If p — aopi ■ ■ -p n , where Pk(z) = z — A&, then 

ip p = arg a + (p pi H h (f Pn and A p = A P1 H h A Pn . 



For now, we do not need to assume that the coefficients of the polynomial are real, although this is 
indeed the case in most applications. 

8 The normalizing factor i~ n is not of vital importance. It is needed to simplify some formula? of the 
next sections. 

9 Nyquist's name was added in translation [translators' remark]. 

10 The work of A. V. Mikhailov (1937) as well as the earlier work of the American engineer H. 
Nyquist (1932) attracted attention to the geometrical method that we describe here. Especially important 
applications of this method were found in automatic control theory, mostly thanks to papers of the 
Romanian mathematician V. M. Popov. However, «Mikhailov's hodograph» was discovered by C. Hermite. 

11 Along the way, we will also prove that the limits tp p (±oo) exist. 
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From Lemma [5l it follows that 

A p = —n (sign Re Ai + • • • + sign Re A n ) = 7r(n_ — n + ). 

□ 

Remark 7. Since there are no roots on the imaginary axis, we have n_ + n + = cleg p. 
Together with j7j), this enables us to find both numbers n_ and n + . 

Remark 8. The increment A p achieves its maximal value, which equals 7r degp, for stable 
polynomials. 

Problem 9. If a polynomial p is stable, prove that <p p is monotone increasing on E. 

Problem 10. let 7 be a closed oriented curve on the Riemann sphere (e.g., the 
imaginary axis is such a curve). let us introduce the «generalized hodograph» = 
{ i~ n p(uj) : w G 7} and define the quantities (pj and AJ analogously to ([5l) and j6j). Con- 
sider the unit circle and the sector (J3j) and formulate for them the analogue of Theorem® 

Problem 11. Theorem® is related to the « argument principle}} in the theory of analytic 

functions. What is the value of the integral - — ; J —-— dz, taken over the imaginary 

^ m -ioo V\ z ) 

axis in the sense of the Cauchy principal value? 

Problem 12. Investigate the hodograph of a rational function. Which rational functions 
do you think should be called «stable» ? 

Problem 13. On the front page you see a stylized picture of the hodograph of the poly- 
nomial p(z) = 32z 6 + 12z 5 + 46z 4 + 21z 3 + 16z 2 + 7z + 1 (the thickness of the curve 
decreases as the parameter uj increases). Where are the roots of p(z) located with respect 
to the imaginary axis? Using Maple, draw this curve on your own. Try to change one of 
the coefficients of p(z). What happens with the hodograph? 

Theorem [6] obtained by such simple tools can already be applied to count 
the number of roots of the polynomial p to the left and to the right of the 
imaginary axis. we can entrust the drawing of hodographs to a computer and 
determine the number of half-turns visually. 

it is worth considering the following questions: 
Why not use a computer to count all roots of a polynomial as well? Which 
of the two problems will require more calculations? for which polynomials 
will the pertinent calculations be hard and their results unreliable? which 
problems may arise for a developer and for a user of such a program? 

Most likely, we will come to the conclusion that it is premature to write a 
program, and the «specific» question of counting the number a p of half-turns 
requires a mathematical rather than a programming solution. this is indeed the 

CASE. 



3. Cauchy indices 

The quantity A p characterizes quite general topological properties of the curve T p . It 
turns out that for counting A p it is enough to know in which order a point moving along 
T p crosses the coordinate half-axes. 

Let 

p(z) = a z n + a x z n ~ x H h a n (a G M, a > 0). (8) 
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Let us consider the real polynomials 



f (cu) = +Re \i- n p(iu])) = a u n + ■■■ , 
fx(u) = -lm [C n p(iu)] = (ReaOcj"" 1 + ••• 



(9) 



satisfying 



i n p(iu) = f (u) - ifi(u), deg/i < deg/ - 
If all coefficients of the polynomial jHJ) are real, then we have: 



(10) 



fo(cu) = aoU) n — a2W n 2 + a^oj 



n—4 



(11) 



fi(oj) = aiu n 1 - a 3 cu n 3 + cl 5 lj 



n—5 



Now it is time to discuss the assumption of the previous section that the polynomial p 
has no roots on the imaginary axis. How can we check this condition? From ( TTOl) one can 
see that, for wGR, 



Thus, we need to use the Euclidean algorithm to find the greatest common divisor d = 
gcd(/o, /i). In the simplest case we get d = 1. Otherwise, we need to find out whether the 
polynomial d has real roots. 

Remark 14. As we will see later, the idea of using the Euclidean algorithm is extraordi- 
narily fruitful. Now we simply ran into it and risk to pass it by, not noticing that it is key 
to solving the entire problem. Can we at this stage guess, perhaps only feel, the value of 
this accidental idea to develop it afterwards? 

Assume that the polynomial p is continuously perturbed so that, at some moment, 
one or several of its roots intersect the imaginary axis, thus changing the values of n_ and 
n+ , which we are interested in. If we apply the Euclidean algorithm to the corresponding 
polynomials fo and /i, then its final result, the greatest common divisor d, will forget 
what happened. Most likely, d was equal to 1 and will again become equal to 1. But what 
if the memory of those events will be preserved in the by-product of the algorithm, which 
are usually thrown out as useless? Later we will see that this guesswork will be confirmed 
in its entirety. 

Still assuming that the curve T p does not go through zero, let us now consider how it 
intersects the imaginary axis. Let uj < u\ < ■ ■ ■ < oo m be the values of the parameter u 
for which the intersections occur. According to , oJk are real roots of the polynomial fo 
of odd multiplicity (so typically simple). Denote 

i k = lim sign— (p p {uj) {k = 0, 1, . . . , m). (13) 



In other words, i k = —1 (ik = +1) if the radius- vector of a point on the hodograph turns 
clockwise (counter-clockwise) when oj passes through the point u k (see the picture). 



PM = ^ /oH = /iH=0 ^ gcd(/ o ,/i)(a;) = 0. 



(12) 
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Lemma 15. 



71 



- (z fc _i +i k ) (k = 1, . . . ,m) 



Proof. For definiteness, suppose that u)k-i corresponds to an intersection of type 1 
on the picture. Then the next value u k corresponds to an intersection of type 2 or 3. In the 
former case, <p p \" k equals +7r, in the latter case, zero, which agrees with the statement 
of the lemma. The remaining three possibilities can be considered similarly. □ 



Lemma 16. 



77 



*0, 



77 



Proof. Since deg/o > deg/i, we have 



tan <fip(uj) 







(uj 



±oo . 



Consequently, the limit directions of the radius-vector are horizontal. 

For definiteness, suppose that the radius- vector approaches the direction of the positive 
real half-axis when u — > — oo. Then between — oo and ooq there must be an intersection of 



type 2 or 3 (see the picture). In the former case ip p 



71 



equals +— . In the latter case it is 



77 



equal to — — . This agrees with the statement of the lemma. The remaining possibilities 
must be considered similarly. □ 

Lemma 17. If p has no roots on the imaginary axis, then 



A 



p - 7T (l + l 1 H Y% r 



Proof. The increment is additive: 



(14) 



+ f P 



+ 



Vp 



+ fp 



+oo 



Therefore, the application of Lemma [TBI and Lemma [16] implies (fT4l) . 

Strictly speaking, we should also consider the case m = —1, i.e., the case when the 
hodograph does not intersect the imaginary axis at all. Using the same reasoning as in 
the proof of Lemma [HI we then will see that A p = 0. This is indeed the proper way to 
understand the formula (fT4l) in case m = — 1. □ 



The quantities ik and their sum occur in other applications. They have specific names. 
Let us consider the rational function 



R(u) = 



/o(w) 



(deg fx < deg f 



(15) 



where the numerator and the denominator are arbitrary real polynomials and are not 
necessarily determined from 

Let Uq < uj\ < ■ • ■ < oj m be the real poles of R of odd order. This means that R(uS) 
changes its sign as it «goes through oo» when uj goes through u^. 

The quantity 



Ind^R) 



+1, if R(u k -0) < < R(uj k + 0), 
-1, if R{u k - 0) > > R{u k + 0) 



(16) 



is called the index of the function R at its real pole u k of odd order. 
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The quantity 

lnd b a (R)= J2 ln <( R ) ( 17 ) 

k : a<uj^<b 



is called the Cauchy index of the function R on the interval (a, b). 
A comparison with (PT3l) and with the picture now shows that 



n_ — n + = Ind_^ 



z fe = Ind^ (i^), where R = — . 

Jo 

Theorem 18. Let the polynomial jEJ) have n_ roots in the left half-plane, n + roots in the 
right half-plane, and no roots on the imaginary axis. Then 

'fx 
Jo, 

where fo and f\ are defined in ([9]) . 

Proof. The theorem follows from Theorem [6] and from Lemma [T7l □ 

Remark 19. Undoubtedly, the case when the polynomial is stable deserves special con- 
sideration. We will devote to it a section of Chapter II. 

Remark 20. If we introduce the step function 
then (fTTl ) implies 

b 

lnd b a (R) = J dU{uj) = U(b-0)-U(a + 0). (18) 

a 

If we could calculate values of the function U(u) without calculating the roots Uj., we 
could conveniently apply Theorem | 



Problem 21. What is the Cauchy index of the logarithmic derivative 
of a real polynomial f ? 

Hint: It equals the number of distinct roots of the polynomial / in the interval (a,b). 

Problem 22. The choice of the imaginary axis for computing values of A p is most con- 
venient. But we could take any other line that goes through 0, except for the real axis. 
Prove this and explain the problem with the real axis. 

Problem 23. Let us draw a graph of the rational function R on the torus S 1 x S 1 instead 
of the plane R 1 x M. 1 . This is useful since one of the points on the circle S 1 must correspond 
to the point at infinity on the axis K 1 . Verify that such a «graph» is a closed curve on the 
torus. Which geometric (more exactly, topological) meaning does Indl^(-R) acquire? 

Theorem [T5] is a better tool than Theorem El However, we seem to go around 

IN CIRCLES: TRYING TO AVOID AN EXPLICIT CALCULATION OF THE ROOTS OF THE INITIAL 
POLYNOMIAL p, WE CAME TO THE NECESSITY OF CALCULATING THE ROOTS OF ANOTHER 
POLYNOMIAL / ! THE NEXT SECTION WILL SHOW THAT THIS IS IN FACT UNNECESSARY. 
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4. Sturm method 

In the book [19J, one can find an elegant theorem of Sturm on counting the number 
of real roots of a polynomial in a given interval. In fact, this theorem — or, more exactly, 
its method — has a wider scope. 

A finite sequence of polynomials {fo, fi, ■ ■ ■ , f n } is called a Sturm sequence on the 



interval (a, b) if 

/o(c)=0 (a<c<b) => /i(c)^0, (19) 

/„(c)^0, Vce(a,6), (20) 

/ fc (c) = (0 < fc < n, a < c < 6) / fc _i(c) / fc +i(c) < 0. (21) 



Given a Sturm sequence, let us introduce the integer-valued function V(x) defined to 
be the number of sign changes in the sequence {fo(x), fi(x), . . . , f n (x)}. The domain of 
this function is the interval (a, b) from which the roots of the polynomials in the sequence 
are excluded. At these points, the function V(x) can have a discontinuity of the first kind. 
However, V(x) does not have too many discontinuities: 

Lemma 24. If c £ (a, b) is not a zero of odd multiplicity of the initial polynomial fo, then 
V{c + 0) = V(c-O). 

PROOF. If f k (c) = 0, then k < n in accordance with pDl) . Now let fc > 0; then (I2B 
implies that fk-i(c) and fk+x(c) are nonzero and have different signs. Therefore, the 
subsequence {/fc_i(c), /k(c), /fc + i(c)} (0 < \x — c\ < e) contains exactly one sign change 
regardless of the sign of fk(x) for x in a small neighborhood of c. 

But if c is a zero of the polynomial f of even multiplicity, then fo(x) does not change 
sign in the punctured neighbourhood < \x — c\ < e. By (fT9l) . this also applies to fi(x). 
Thus, the subsequence {fo(x), fi(x)} has the same number of sign changes (0 or 1) to the 
left of the point c as it does to the right of c. □ 

Lemma 25. If c G (a, b) is a zero of fo of odd multiplicity, then 

K(c + O)-^(c-O) = -Ind c 0^. (22) 

PROOF. If the index at c is equal to +1, then the function /i//o changes its sign from 
— to + when x goes through the point c. The subsequence {fo(x), fx(x)} thus loses a sign 
change and V(x) decreases by 1. In case the index is negative, the opposite holds. □ 

Theorem 26 (Sturm). El 

Ind^|) =V(o + 0) -V(b-0). (23) 

Proof. Use the two previous lemmata and the fact that the full increment of a step 
function is the sum of its increments at points of discontinuity. 

V(b - 0) - V(a + 0) = l V ( c + °) - V ( c ~ °)]> 

where summation is taken over all discontinuities of the function V, that is, over all zeros 
of fo of odd multiplicity that lie in (a, b). □ 



12 We foresaw the existence of such a formula — see Remark [2Ql It would resemble (jTHJ) even more if 
we defined V(x) as the number of sign retentions rather than the number of sign changes. 
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Remark 27. Note that neither in the definition of Sturm sequences nor in the proof of 
the Sturm theorem was it essential that /o, . . . , f n are polynomials. Only the fact that 
these functions are continuous and have finitely many roots was used3- 

Let us consider an important special case of Theorem ES 
A Sturm sequence {/o, fx, ... , f n } is called regular if 

degf k = n-k (k = 0, 1, . . . , n). 

Theorem 28. Let h k be the leading coefficient of a polynomial ft- from a regular Sturm 
sequence {f , f u . . . , f n }. Then 



where v(ho, hi, ... , h n ) is the number of sign changes in the sequence {ho, hi, ... , h n }. 

PROOF. For large \x\, the sign of fk{x) coincides with the sign of its leading term 
hkX n ~ k . Therefore, 

V(+oo) = v(ho, hx,..., h n ), 

V(-oo) = v(h n , -hn-i, . . . , (-l) n h ) = n - v(h , h x ,..., h n ) 

(the latter quantity is the number of sign retentions, which, together with the number of 
sign changes in the sequence {ho, ■ ■ ■ ,h n }, sums up to n). □ 

It remains to discuss how a Sturm sequence can be constructed from an initial pair 
of polynomials {/o,/i}- For the case deg/ > deg/i, which we focus on, a modified 
Euclidean algorithm can be used. The modification consists in changing the sign of the 
remainder at each step: 

/fc_i = d k fk ~ fk+i, deg f k+1 < deg f k . (25) 

It does not change the original meaning of the algorithm since, at the last step, we will 
still obtain gcd(/ ,/i), but it guarantees that the condition (l2lil is satisfied. Regarding 
the conditions (fT9l ) and (1201 ). for this construction they are equivalent (prove it). 

Remark 29. This is not the only possible method for constructing a Sturm sequence. 

Remark 30. «Though the Sturm method is excellent in theory, it is not convenient 
in practice due to the enormous number of numerical coefficients of various powers of 
x in a sequence of Sturm functions when an equation of high enough degree is given. » 
(P. L. Chebyshev) 

Problem 31. How can we construct a Sturm sequence if deg fo < deg/i? 

Problem 32. Use the result of Problem {2l\ and suggest an algorithm for counting the 
number of distinct roots of a polynomial f on an interval (a,b). Use this algorithm to 
count the number of roots of the polynomial p on the imaginary axis. 

We have solved the Routh-Hurwitz problem, and not only for real but also 
for complex polynomials. we found out that this problem is algorithmic ally 
equivalent to finding the greatest common divisor of two polynomials and is 

THEREFORE RATHER SIMPLE. It TURNS OUT THAT, WHEN USED TO EXAMINE THE STABILITY 
OF REAL POLYNOMIALS, IT SIMPLIFIES FURTHER DUE TO THE SPECIFIC STRUCTURE OF THE 
INITIAL POLYNOMIALS (jTTJ) . THIS WILL BE TAKEN UP IN THE NEXT SECTION. 




(24) 



Thus, the Sturm method solves problems of algebra by means of analysis. As a result, it is disliked 
by both algebraists and analysts. 
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5. Routh scheme 

Let polynomials fo and f\ be defined as in ( TTTi ). If a% ^ 0, then the quotient and the 
remainder in (1251 ) are 

g?x (a;) = ecu = — j , (26) 

f 2 (u) = (a 2 - ca 3 ) ^ n " 2 - (a 4 - ca 5 ) c^"" 4 + ■ ■ • . (27) 

We see that, first of all, f 2 has the same structure as fo and /i, and if its leading coef- 
ficient is nonzero, then the same procedure can be applied to the pair {f\, f 2 }. Secondly, 
the coefficients of f 2 occur in the second row of the rectangular matrix 

/ «1 % «5 • • • 
V QiQ d 2 O4 . . . 

after the Gaussian elimination of the entry a . 

This is the basis for the computational Routh scheme. In textbooks and handbooks, 
the Routh scheme is usually described in a form suitable for computations by hand. 
Thanks to the progress of programming, mathematics now has new tools for recording its 
algorithms. Let us use standard Pascal. 

Let the coefficients of the polynomial (jSJ) be stored in the array var h: array [0. .n] 
of real; The transition from {/o,/i} to {fi,f 2 } described in (I26l)-(l27j) corresponds to 
the formal transition from the polynomial p to the polynomial 

ai z n ~~ l + (a 2 - ca 3 ) z n ~ 3 + a 3 z n ~ 3 + (a 4 - ca 5 ) z n ~ 4 + . . . , 

whereupon the process goes on provided that corresponding coefficients are not zero. This 
algorithm is realized by the routine Routh , which returns the logical value true after 
a normal completion; in this case, it places the leading coefficients ho, h%, . . . , h n of the 
Sturm sequence polynomials into the array h, to which Theorem 4.2 is then applied. 

function Routh(var h : array [0 .. n] of real) :boolean; 
var k, j : integer; 
c : real ; 

begin 

k:=l; 

while (k<n-l) and (h[k]<>0) do 
begin 

c:=h[k-l]/h[k] ; 

k:=k+l; 

j:=k; 

repeat 

h[j] :=h[j]-c*h[j + l] ; 

until j>=n 

end; 

Routh :=(k=n-l) 
end {Routh}; 

After the kth step, «the tail» [k. .n] of the array h is filled with the alternating 
coefficients of the polynomials {fk, fk+i}, but «the head» contains the leading coefficients 
of all preceding polynomials of the sequence as some sort of «useful trash». 

We begin our analysis of the algorithm by criticizing it. If Routh (h)=f alse, then 
we will only know that the given polynomial generates a nonregular sequence, but the 
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question of where its roots are located will remain open. This is not a dead end since the 
original Sturm method can work with these nonregular situations too. We will not go into 
details, since all is well in the case of interest to us: 

Theorem 33. A polynomial is stable if and only if Routh(h)=true and ho, hi, ... , h n 

are not zero and of the same sign. 

Proof. By Theorems [T8l and l26l stability implies V(— oo) — V(+oo) = n. On the other 
hand, since V(— oo) < n and V(+oo) > 0, this must be the extreme case V(— oo) = n, 
V(+oo) = 0. The former equality shows that the length of a Sturm sequence is maximal 
and equal to n+1; such a sequence is regular. The latter equality implies that all h k are 
of the same sign. Hereby the necessity is proved. Now, Theorems [28] and [18] imply the 
sufficiency □ 

Here comes a pleasant surprise of the algorithm^. As Routh was developed, it was 
assumed that the polynomial has no roots on the imaginary axis and that it generates a 
regular Sturm sequence 

f k (z) = h k z n ~ k + ■■■ (h k ^0, k = 0, 1, . . . , n). 

But Routh does not require division by the last two coefficients h n and /i n _i, so these 
coefficients, unlike the rest, can take the value zero. Thus, the scope of Routh is wider 
than originally intended. This property of the algorithm comes in handy. The point is 
that the loss of stability of a fixed point of the system (jTJ0 is accompanied by effects 
determined by precisely how the eigenvalues of the matrix A leave the left half-plan^. 
The following two scenarios are most common: 

• a simple real eigenvalue crosses the imaginary axis at the point when it enters 
the right half-plane; 

• a pair of simple non-real eigenvalues crosses the imaginary axis at points ±ico 
(uj^O). 

The Routh algorithm can distinguish between these two variants: 

Theorem 34. Let Routh (h)=true. Then, 

a) ifh n -\ ^ 0, h n ^ 0, then the polynomial has no roots on the imaginary axis, and 

n + = v(h , . . . ,h n ), n- = n-v(ho,...,hn); (29) 

b) if hn-i ^ 0, h n = 0, then the polynomial has one simple root on the imaginary 
axis at the point 0, and 

n + = v(h ,...,h n - 1 ), n- = n — l — v(h ,...,h n - 1 ); (30) 

c) if h n _i = 0, h n _ 2 h n < 0, then the polynomial has no roots on the imaginary axis, 
and 

n + = v(h ,...,h n _ 2 ) + l, n- = n-v(h Q ,...,h n - 3 )-l; (31) 



Why keep talking about sad things? 

«... ?ojnx i'x jn'Vn njn nsa oyaj? -pr tnyn oxn : Dijnj^'ns oyaj? iia pjn nyojn Tmb 

[Let us talk of something amusing. What is the news about cholera in Odessa?] » (Sholom Aleichem) 
15 Under perturbation of the system (Q]) and therefore of A [translators' remark]. 
16 A new fixed point or a limit cycle can branch from a fixed point that loses stability. Bifurcation 
theory studies such phenomena. 
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d) if h n -i = 0, h n - 2 h n > 0, then the polynomial has two simple roots on the imagi- 
nary axis at the points ±ico (u ^ 0), and 

n + = f(/to, ...,h n -z), n- = n - 2 - v(h , . . . , K- 2 )\ (32) 

e) if h n -i = 0, h n = 0, then the polynomial has one double root on the imaginary 
axis at the point 0, and 

n+ = v(h , ...,h n - 2 ), ri- = n-2-v(h ,...,h n - 2 ). (33) 

Proof. The last three polynomials of the Sturm sequence are: 

fn-2{x) = h n _ 2 X 2 ~ h n , f n -l(x) = K-lX, fn{x) = h n . 

In case (a), the sequence is regular. In the remaining cases, one or both polynomials 
fn-i, fn are identically zero. Recall that we deal with the Euclidean algorithm, therefore 
the construction of fk must be stopped as soon as there is division without remainder 
in ( f25l) : then the last nonzero polynomial fk is the greatest common divisor of the initial 
polynomials up to a numeric factor. We have: 

d{uo) = f n -x(u) — to in case (b), 

hn-i 

1 h 

d(u) = f n -i{oj) = uj 2 - in cases (c), (d), (e). 

The statements of the theorem about the number and the location of roots of p(z) on 
the imaginary axis follow from (fT2l ). It suffices to verify (l29l)-(l33l. 

In the «safe» case (c), the last polynomial f n - 2 is not equal to zero on the real axis, and 
the system {/ , fi, ■ ■ ■ , /n-2} is a Sturm sequence. Therefore we have to use Theorem [26] 
instead of Theorem EH 

Cases (b), (d) and (e) require correction of the initial polynomial p(z). It should be 
divided by i l d(—iz), where I = degd, in order to remove its purely imaginary roots. At 
the same time, fo and f\ should be divided by their greatest common divisor d. Their 
leading coefficients ho and hi do not change since we normalized d beforehand. Therefore 
the corrected polynomial requires no recalculations. We just have to take into account 
that its degree has decreased and to reuse Theorem [281 □ 

Remark 35. A substantial difference between h n and h n _i is that the last component of 
the array does not get processed. It remains equal to the last term a n of the polynomial 
p. On the contrary, the penultimate term of the array is subjected to the largest number 
of arithmetical operations. 

Problem 36. Run Routh manually for polynomials of degree 3 and 4 and find necessary 
and sufficient conditions for the stability of these polynomials. 

Problem 37. Program the routine 

function IsStable(h: array [0. .n] of real) : boolean; 

that inputs the array h of coefficients of a real polynomial p and returns true if the 
polynomial p is stable or false otherwise. Program also the routine IsStableStodola 
which differs from Is St able by testing a polynomial first for the Stodola condition. Is 
such an improvement of Is St able useful? 

Problem 38. The following source code is written in APL (see [12JJ: 



We recommend that the reader give a proof on his/her own. 
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V B«- Whatlslt A 

LOOP: ((l=ljpA) V (A[2;l]<0)) /EXIT 

A^(2 2p0 1 1 -A[l;l]-=-A[2;l]) +.X (0 1 1 A) 

-►LOOP 
EXIT: B«-(A[2;1]>0) 

V 

The input parameter A is a matrix of the form (128|) . Which algorithm does the function 
Whatlslt realize? How can it be improved? Give a comparative analysis of Pascal versus 
APL as algorithmic languages. 

so we now have at our disposal a simple (possibly unimprovable) algorithm 
for testing the stability of a real polynomial. we just have no explicit formulae 
that express the output of this algorithm. for comparison, the gauss algorithm 
and the Cramer formulae are complementary in the theory of linear systems: 
the former describes how to find a solution, the latter how this solution looks. 



Given a polynomial 
the following structure: 



6. Hurwitz theorem 

with real coefficients, consider a corresponding nxn matrix of 



Hp = 



(a\ a% a$ a-j 

CLq Oil CL4. QiQ 

a\ a 3 a 5 

a a 2 04 

ai a 3 



(34) 



(the coefficients a , . . . , a n are not enough to fill the rows, but we set a n+ i = a n+2 = ■ ■ ■ = 
0). The matrix Ti p is called the Hurwitz matrix of the polynomial p. Let us denote by rjk 
the leading principal minor of this matrix formed from the first k rows and columns. It is 
easy to check that there is only one (diagonal) nonzero term in the last column of Tt p . It 
equals a n . Therefore 

(35) 



Lemma 39. In the regular case, 

Hi 1 ; / 



h = 771, h 2 



Vn-l 



(36) 



Proof. First note that the matrix TC P consists of blocks of type (l28l) . which are in 
turn made up from the coefficients of the polynomials fo and f\ . Let us reduce the matrix 
Tip to upper triangular form using Gaussian elimination without pivoting. By (l26l) -f|27l). 
the elimination of the term a from the second, fourth etc. rows leaves these rows filled 
with the coefficients of f 2 , the next polynomial of the sequence. The first row and column 
are no longer needed. Crossing them out (temporarily), we obtain an (n — 1) X (n — 1) 
matrix of the same «Hurwitz» structure formed from the coefficients of the polynomials 
fi and f 2 - Repeating the same procedure, we will arrive at a triangular matrix 

fh x 





* 

h 2 





* 
* 



\0 



* 



hnJ 
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whose kth row consists of the coefficients of the polynomial f k , starting with the leading 
coefficient h k on the main diagonal. 

Each elementary transformation that we applied consisted in subtracting the preceding 
row (multiplied by a suitable number) from a given row. Such transformations preserve 
not only the determinant \7i p \ = r] n but also all leading principal minors r] k . Consequently, 

Vk = hh 2 ■ ■ ■ h k (k = 1,2, . . .n), (37) 
which implies (l36l ). □ 
Theorem 40 (Hurwitz). A polynomial 

p(z) = a z n + a\Z n ~ x + V a n (a , Oi,...,a n £ R; a > 0) (38) 

is stable if and only if all leading principal minors of its Hurwitz matrix Ti. p are positive: 

Vk >0 {k = l,2,...,n). (39) 

Proof. The assertion of the theorem follows immediately from the preceding Lemma 
and Theorems [331 and [341 □ 



Remark 41. Let us assume that the polynomial p depends continuously on one or several 
parameters and is initially stable. Lemma 6.1 and Theorem 5.2 imply that its stability is 
maintained for as long as \H P \ = r] n is not zerc0. At the same time, owing to (1341) and to 
Theorem 5.2 again, a sign change in a n (for ?7„_i ^ 0) corresponds to the transition of a 
simple real root from the left to the right half-plane. A sign change in r/ n _i (for a n ^ 0, 
Vn-2 7^ 0) corresponds to the transition of a pair of simple complex conjugate roots from 
the left to the right half-plane. 

Remark 42. The observation we just made appears useful: in order to «catch» the 
moment when a continuously varying polynomial loses stability, it is sufficient to check 
only the sign of the last minor r\ n . This is, however, no reason to celebrate: Theorem [331 
implies that, while the polynomial remains stable, the determinant \Tl p \ is best computed 
using the compact Gauss scheme, which yields all the leading principal minors r\ k as 
a by-product anyway. In general, it makes no sense to apply the Hurwitz theorem for 
computations - to this end one should use the Routh scheme. 

Remark 43. In 1914, A. Lienard and M. Chipart proposed a different criterion of polyno- 
mial stability. They established that a polynomial (l38l of degree n with positive coefficients 
is stable if and only if the following conditions are satisfied: 

V2 > 0, ?7 4 > 0, . . . ,7] n -i > 0, if n is odd, 

(40) 

rji > 0, 773 > 0, ... , r\ n -\ > if n is even. 

The Lienard-Chipart conditions (t40l) look simpler than the Hurwitz conditions (|39l) 
since they contain half that many determinantal inequalities. Although the simplicity 
is misleading from the computational point of view (for reasons given in Remark [42]) , 
conditions (1401 ) may be more useful for formal derivations, and the equivalence of (T39T ) 
and ( [401 is very interesting from the theoretical point of view. 

Problem 44. Assuming the minors r)i,...,T] n are all nonzero, prove that the number 
of zeros of the polynomial p in the right (left) half-plane is equal to the number of sign 

variations (retentions) in the sequence < do, 771, — , 



rji Va-l 



18 Unlike the fish, the Hurwitz conditions «rot from the tail». 
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Problem 45. Let allr] k be nonnegative. Does this imply that all roots ofp lie in the closed 
left half-plane? 

Hint: No, it does not. Give a counterexample. 

Problem 46. Prove that the statement of the Hurwitz theorem remains valid if 
ao, di, 02, . . . in (l34l ) are the coefficients of the polynomial 

p{z) = a + a\z + a 2 z 2 + ■ ■ ■ (do, fli, . . . G R; a > 0). (41) 

Hint: if p(0) ^ 0, then the polynomials p(z) and q(z) = z n p(z~ x ) (n = degp) are 
simultaneously stable or unstable. 

Problem 47. Ifp(z) is an analytic function represented by the power series (!1T|) . then we 
can formally construct its infinite Hurwitz matrix and require the positivity of its leading 
principal minors. Give an example showing that the Hurwitz theorem does not generalize 
to analytic functions. In this connection, see |27L Q], [7]. 

Our goals are achieved, and we are done with the basic material. In Chapter 
II we will consider many variations on the topic of stable polynomials. That 

OPTIONAL MATERIAL CAN BE USED FOR SEMINARS AS WELL AS FOR INDEPENDENT STUDY. 



tio J1 



Appendix to Chapter I 

Here we consider the stability problem of the Lorentz system posed in the Introduc- 

X = aY-aX, Y = rX -Y - XZ, Z = XY-bZ. (42) 



1. Determining fixed points. A fixed point of the system (jj2j ) is a point (Xk, Y k , Zk) G 
R 3 that annihilates the right hand sides in (|42l . i.e., a solution to the following system of 
algebraic equations: 

aY k - aX k = 0, rX k -Y k - X k Z k = 0, X k Y k - bZ k = 0. (43) 
One of the solutions to (T43l is easy to spot: it is the zero fixed point 

(X ,Y Q ,Z ) = (0,0,0). (44) 
In addition, the system (1431) has two more solutions 



{X 1>2 , Y 1>2 , Z 1>2 ) = (±y/b(r-l), ±^b(r-l), r - 1). (45) 

Remark 48. The zero fixed point exists for all positive values of the parameters a, b, r. 
A pair of nonzero fixed points f T45l) bifurcates from it when r > 1. 



2. Linearization. In order to linearize the system x = f(x) in the neighborhood of a 
fixed point x k (where f(x k ) = 0), one has to replace the function f(x) by its linear form 
A k (x — Xk) where A k = f\x k ) is the Jacobian matrix consisting of the partial derivatives 
of the function f(x) taken at the point x k . In our case, 

/ -a a \ 
A k =\ r-Z k -1 -X k {k — 0, 1, 2). 

V Y k X k -b J 



'We remind the reader that the parameters b, r, a are assumed to be positive [translators' remark]. 
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3. Computing the characteristic polynomial. We have 

p fc (A) = \XI-A k \ = (X + a)(X + l)(X + b) + aXl + a{Z k -r)(X + b)+Xl{X + a) 

since Xk = Yk for all three fixed points. 
For the zero fixed point (l44l ). we obtain 

Po(A) = (A + 6)[A 2 + (a + 1)A + (7(1 - r)]; 
for the fixed points | !4"5l) . we get 

Pi,2(A) = cioA 3 + a-iA 2 + a 2 A + 03, where 
do = 1, ai = cr + b + 1, a 2 = b(a + r), 03 = 2ab{r — 1). 

4. Stability of fixed points. The Hurwitz theorem is in fact not needed to analyze 
the stability of the polynomial po - the elementary Vieta's Theorem is enough. This 
polynomial is stable for < r < 1. 

Remark 49. At r = 1, one of the roots of the polynomial po crosses the imaginary axis 
and enters the right half-plane. The zero fixed point loses its stability, and exactly then 
the fixed points (1451) bifurcate from it. 

For the polynomials pi )2 of degree 3, the Hurwitz matrix has the form 

a 3 
a 2 

and the conditions of Theorem [40] say 

a 3 > 0. 

As a\ > 0, 03 > for r > 1, there remains just one inequality 

aia 2 — cio a 3 = b(a + b + l)(cr + r) — 2ba{r — 1) > 0, 
which can be viewed as an answer. However, it is better to represent the answer as follows: 



>0, 




a 3 


>0, 


ai 


a 3 




a 


a 2 




a 


a 2 



1 < r < r* where r* 



a £±|±| a >b + l, 
+00 otherwise. 



Remark 50. The fixed points that bifurcate from (0, 0, 0) are initially stable. When the 
parameter r reaches its critical value r*, they lose their stability since, by Theorem [34] 
(also see Remark [44]), a pair of nonreal roots of the polynomial pi j2 crosses the imaginary 
axis. One can in fact show that, at r = r*, limit cycles branch from the fixed points that 
lose stability. Further discussion of these difficult and interesting questions is beyond the 
scope of our notes; these questions are subject of bifurcation theory. 



CHAPTER 2 



Extensions 




"Voila le sujet simplifie, argumentum omni denudatum ornamento. Je ferais 
avec cela, continua le jesuite, deux volumes de la taille de celui-ci. " 

Et, dans son enthousiasme, il frappait sur le saint Chrysostome in-folio 
qui faisait plier la table sous son poids. 

D'Artagnan f remit. 

Alexander Dumas. Les Trois Mousquetaires. 

Chapitre XXVI. La these d'Aramis. 
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7. Stieltjes Fractions 



The definition of the Cauchy index Ind_^(i?) given in §[3]presupposed that the rational 
function R vanishes at infinity. Let us try to generalize this notion to arbitrary rational 
functions. A useful idea is contained in Problem [23) the point at infinity oo should be 
«made equal» to the other points, and it makes sense to consider the function R as a 
map of the projective line^ PR 1 = R 1 U {oo} into itself. In particular, if R = /i//o and 
deg/i > deg/o, then we will view R as having a pole at oo of order deg/i — deg/o. If 
that pole is of odd order, then, analogously to ([TBI) (§[3]), we let 



Indeed) 



+1, if #(+00) < < R(-oo) 
-1, if #(+00) > > R(-oo), 

in all other cases, let Indoo(i2) = 0. This summand should be added to ( TT7l ) (§02): 
Ind PR (f2) = Ind+^(i?) + Ind^R). 



(46) 



(47) 



Remark 51. Incidentally, (1461) implies that rays (C, +00) should be viewed as left half- 
neighborhoods of the point 00, and rays (—00, — C) as its right neighborhoods. The pro- 
jective line is in one-to-one correspondence with the circle S 1 , as illustrated below: 



00 




Indoo(d) 



(48) 



Remark 52. Let d{u) = cu u + ■ ■ ■ {y — degci) be a polynomial. Then 

• sign c, if v is odd, 
0, if v is even. 

Now note how the index IndpK(i?) changes when various «projective» maps act on R. 

Lemma 53. If d G M is a constant, then IndpR(d+ R) = IndpR(-R). 

Proof. The addition of a constant does not change the behaviour of a function near 
its poles. □ 

Lemma 54. If d is a polynomial and R(oo) = 0, thenlndp^(d+R) = Ind^(-R)+Indoo(<i). 

Proof. See ([46l) ([471) . □ 

Lemma 55. Indpnj (— jj;) = IndpiR(-R). 



For sure, the point is not just to add a new element to the set K , we need to incorporate that 
element into the algebraic and topological structures that exist in K 1 . More about projective spaces PM" 
can be found in I13j . 
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Proof. The projective line PR 1 is divided by its two points and oo into its positive 
M + and negative M~ rays. Let a variable uj traverse PR 1 and return to its starting point. 
Clearly, the number of crossings from R~ to M + must equal the number of reverse crossings. 
The crossings through oo occur at the poles of the function R, and they are accounted 
for in the sum (1471 ) with the appropriate sign. The crossings through occur at the zeros 
of i?, i.e., at the poles of ^, and they are accounted for in the analogous formula for 
Indp R (^). As a result, Ind PR (^) + Indp K (i?) = 0. □ 

Remark 56. The transformations uj i— > uj + d are called shifts of the projective line PR 1 . 
The point — - is referred to as the polar of uj G PR 1 . If the diameter of the circle on Fig. 2 
is equal to 1, then the polar B of the point A is constructed by drawing the perpendicular 
BC to AC. By a theorem from elementary geometry, OA ■ OB = OC 2 . Another theorem 
(on a subscribed and a central angle) implies that A'B' is a diameter. 




Alternating between shifts uj i— > uj + d and the polar transformation uj \— > —-, we 
obtain a continued fraction. The right-hand sides of the following formula are examples 
of continued fractions: 



Lemma 57. // a>5 — = 1, then 



< 1 

7 



7 



au + (3 
■yu + 5 



1 

1 



ay — 



7 



(a(3 + a) — 



a 



oo 



(7 + 0), 



(7 = 0). 
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Lemma 58. // a8 — [3y = 1, then 

1 



au + (3 
■yu + 5 



(7 ^ 0), 



7 



7 



7 



oey — 



S 

- + u 

7 



(49) 



(aj3 + a) 



(7 = 0). 



a — 



a: 



PROOF. By straightforward calculation. 

The fractional-linear map 

auj + (3 



□ 



+ 5 



a,/?, 7, 5 G R; a5 - £7 = 1) 



performs a projective transformation PR 1 — > PR 1 . Such transformations form the ^roujH 
denoted by SL(2,R). 

Theorem 59. The Cauchy index is invariant under the action of the group <SX(2,R) on 
rational functions: 



Ind PM ($ o R) = Ind PR (i2), V$ G SL(2, R). 
Proof. Apply Lemmata [ 



□ 



Now let us explore connections between projective geometry, continued fractions, and 
the Euclidean algorithm. For definiteness, suppose that R(oo) = 0, so that R = /1//0, 
deg f\ < deg fo in Sections 3 and 4. Run the modified Euclidean algorithm, which we used 
in §[4] to construct Sturm sequences: 



fk-i = d k f k - f k+1 (k = 1, . . . , m; deg f k+1 < deg f k ; /, 



m+l 



0). 



If 



Ru = 



fk+1 

fk 



(k = 0, 1, . . . , m; Rq = R, R m = 0), 



then (1501 ) yields the recurrence relation 

1 



—Rk-l 



d k — R k 



(50) 



(51) 



(52) 



2 One can find information about this group - and other things - in [3[ §5]. 
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and we obtain the following expansion of the rational function —R into a continued 
fraction: 

-R = ^ , (53) 

dx 



do 



1 

dm 



A functional continued fraction of type ( |53l ). where di, . . . , d m are polynomials, is called 
a Stieltjes continued fraction (see [TTl EM l29l l30l I31]). 

Theorem 60. If a rational function R is represented by a continued fraction (I53L then 

m 

Ind PM (i?) = -J2 Indoo(4). (54) 

k=l 

Proof. Apply Lemmata [54] and [55] inductively, using the relation ([52]) . □ 

Remark 61. Theorem l60l shows a way to compute Cauchy indices, which parallels Schur's 
method. It uses the same (Euclidean) algorithm, but its validity is established by different 
reasoning. 

Now consider the extreme case when the function R is generated by a stable polyno- 
mial, as described in §[3j It is time to give such functions a name. We will call a rational 
function R — fi/fo proper if deg fi < deg fo and IndpR(-R) = deg fo] the class of all proper 
functions will be denoted by 1Z. 

Theorem 62. R ElZ if and only if 

-R(u) = 1 , (55) 

aiu + Pi 

a 2 uj + P2 



1 



a n uj + p n 



where Pi, . . . , P n G K., «i, . . . , a n > 0. 



Proof. Formula f[50l) implies that n = Yuk=i ^eg^fci whereas (T48l) implies the inequal- 
ity — Indoo(rf) ^ deg d, where equality occurs if and only if d{u) = au + P (P G R, a > 0). 
A comparison with (1541) now shows that the polynomials di, . . . , d m in (l53l) must satisfy 
exactly these conditions, and their number must be exactly n. □ 

Remark 63. The transformations z t— > az + P (P G R, a > 0) and z 1— > — map 

the complex upper half-plane {z G C : Imz > 0} into itself. Therefore, this property is 
inherited by the function —R if it admits an expansion of type ( 1551) . This theme will be 
taken up again in §[8], but our next problem already has a hint of a variation on this idea: 

Problem 64. Let 

R(u) = j , (56) 

aiuj + Pi -\ j 

a 2 uj + p 2 H j 

'"• + 7T 

a n u + p n 



28 

where ack, (3k > (k = 1,2, . . . , nj. T%en ifte function R maps the right half-plane {z G 
C : Kez > 0} mio itself; all its poles and zeros must lie in the left half-plane. 

Problem 65. Consider a tridiagonal matrix 



A 



Prove that 

• all eigenvalues of A are real and simple whenever b 2 c 2 , ■ ■ ■ , b n c n > and 
oi, . . . , On G 1; 

• all eigenvalues of A lie in the open right (resp., left) half-plane whenever b 2 c 2 , 
. . . , b n c n < and a\, . . . , a n > (resp., < 0). 

Hint: Consider the rational function R(X) = A n _i(A)/A n (A) where A n (A) denotes the 
determinant of the matrix XI — A and A„_i(A) denotes its principal minor obtained by 
omitting its last row and column. For this function, obtain a decomposition of type (1551 ) 
or ( 1561) and use the idea from Remark [63] and Problem EH 





c 2 


.. 





\ 


b 2 




c 3 . . 











h 


a 3 . . 














.. 






\o 





.. 







Problem 66. Let A : 03 — > 03 fre a bounded linear operator on a Banach space 03, /ei 
u 6 03 be a vector in 03, and let G 03' 6e a bounded linear functional acting on 03 swc/i 
£/ia£ <^(m) 7^ 0. We introduce 

• the subspace 03i = {x G 03 : (p(x) = 0} C 03, 



ip(Ax) 
ip(u) 



u, 



• the operator A\ : 03i — > 03i, A\ : x (—>• Ax 

• tte vector u\ G 03i % «i = Au — u, 

• the functional ipx G 03 1; y?i : x 1— > <^(Ar), 
and consider the two analytic functions 

R(X) = <p ((XI - A)- l u) , R x {\) = Vl ((XI - A,)-^) 
Prove that 



-R(X) 



where St = (f(A h u) (k = 0, 1). 



As - si - Ri(X)' 
Consequently, if dim 03 < 00 ; then Indl^(i2) = signs + Indl^(i?i). 
Problem 67. Prove that the projective transformations 



$ T : PR 1 -> PR 1 , $ T : 



re 



(57) 



1 — ru; 

/orm a one-parameter group, i.e., 

Problem 68. Using the notation of Problem[6b\ consider 

A t : 03 -> 03, A t : x ^ Ac + 1 </>(x) u, i? t (A) = ip ((XI - A t )~ x u) , 
and let $ r 6e defined as in ( 1571 ). Prove that Rt+ T = & T °Rt- Consequently, if dim 03 < oo, 



Ind^(i?t)=Ind^(i? T ) (Vt,rG 
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Problem 69. Theorem{5E dealt with the left action of the group SL(2,M), where the 
group acts on the value of the function R. Prove that IndpR(-R) is also invariant under 
the right action of the group SL(2,M), where the group acts on the argument: 

Ind PM ( J R o $) = Ind PR ( J R), V$ e SL(2, R). 

Problem 70. If polynomials fo and f\ are as in $5| then the function R = /i//o is odd, 
and Routh 's algorithm, when it halts, yields an expansion of R into a continued fraction 
of the following form: 



R(lu) 



1 



C\0J — 



c 2 oj - 



1 



Prove that the following expansions are valid as well: 

1 



R(u) 



1 



in 



2k -1), 



Cl 



c 2 or - 



c 3 



C2fc-1 



in 



2k). 



C-2 



C 3 UJ Z 



1 

C2k 



Problem 71. The expression 



{R,z} = 



R"'(z) 
R'(z) 



R"{z) 
R'(z) 



is called the differential Schwarz invariant, or the Schwarzian derivative of the function 
R (see |15| ). Prove that 

{$ o R, z) = {R, z) V$GSI(2,C). 

If R(z) = s z + SiZ 2 + s 2 z 3 + • ■ ■ , prove that 



{J2,0} = i 



5 Si 

51 s 2 



8. Hermite-Biehler Theorem 



Let a polynomial 

p(z) = a z n + a\z n ~ x H V a n (a > 0; m, . . . , a n G R) 

be stable. According to Theorem 3.1, the function R = fi/ fo, with real polynomials f 
and fi that are defined by 

r n p(iu;) = f (uj) - if x {w) (uj E R), (58) 
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is proper: Ind^(-R) = deg/o = n. Since the sum (fT7j) (or, equivalently, ([471) ) has no more 
than n terms ±1, this equality is possible only if the number of terms is exactly n, and 
they are all equal to +1. Therefore, the polynomial fo necessarily has n distint real roots 
u>x < ■ ■ ■ < u ni and since deg/o — n , it cannot have additional (nonreal or multiple) roots. 
Hence R splits into elementary fractions as follows: 

n 

R{z) = V a k = Res^R). (59) 

k =i z ~ Uk 

By ([TBI) . Ind^i?) = signer, and all indices are equal to +1 in our case, so all residues 
a k must be positive. Thus 

Im R(z) » a k 

—j =-Li (Imz^O), 

Imz fc=1 \z - u k \ 2 

dR(z) ™ a fc 

— i =-L7 lmz = 0. 

The function —R therefore maps the upper half-plane {z: Imz > 0} into itself, 
and monotonically increases between its consecutive (real) poles. So, between any two 
consecutive roots u^-i, uj k (k — 2, . . . , n) of the denominator f there must lie exactly one 
(simple) root of the numerator fx. Since deg/i < n — 1, the polynomial /i cannot have 
any additional roots. 

Furthermore, the formula ( [581 ) is algebraic, so it does not matter that the argument uj 
was initially assumed to be real. This formula may be re-written as 

r n p(z) = f (-iz) - ifx(-iz) (z G C). (60) 

Hence 

, lm(—iz) ^ 

p(z) = => R(-iz) = -i — ^ — f = Rez<0. 

Im(— z) 

The polynomial p is stable! We now summarize this walk «there and back again»: 



Theorem 72. Given a polynomial p, let polynomials fo and fx be defined by (1581 ) . and 

let R = fx/ fo- The following conditions are equivalent: 

(1) the polynomial p is stable; 

(2) the function R is proper; 

(3) the function R admits a representation of type ([59l . with u; fc G IR and > 
(fc= l,...,nj; 

(4) ifte function —R maps the upper half-plane into itself; 

(5) the roots of the polynomials fo, fx are real and simple, between any two consecutive 
roots of fo there is exactly one root of fx, an<^ 

3ueR : /iM/oM-/oM/i(w)<0. (61) 

Remark 73. The statement that conditions (1) and (5) are equivalent is called the 
Hermite-Biehler theorem. The two mathematicians obtained this result simultaneously 
(1879) and independently - see 1 1 1 L I27L I18| ; analogues for entire functions are given in 

m 



3 The last condition is a boring add-on. The pair fo, fi should be normalized so that the ratio /i//o 
be an decreasing rather than an increasing function over E. 
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Proof. We already know that 

(1) <=► (2) 

(4) <= (3) =► (5) 

To prove the implication (3) •<= (5), note that the reality and simplicity of the roots of 
fo imply the possibility of a decomposition of type (l59l) . Next, if two consecutive residues 
and a k were of different sign, then the interval (uJk-i, <^k) would contain an even 
number of roots of R. All residues a k are therefore of the same sign, namely positive, in 
view of (EI). □ 

Remark 74. Analytic functions that map the upper half-plane into itself are well studied. 
They play an important role in the spectral theory of self-adjoint operators (see [2], I16L 
I17|). The generic representation of such a function is 

+oo 

F(z) = az + /3 + I ^^d9(oo) (Imz^O), (62) 

J LU — Z 

— oo 

where a > 0, (3 G R, and 9(uS) is a nondecreasing function with finite limits 0(±oo). The 
function 9 has only a finite number of growth points if and only if R = —F is a proper 
rational function. 

Problem 75. Prove that the logarithmic derivative of a polynomial f with (not necessarily 
real) roots u>i, . . . ,u n satisfies (1591 ). where the residues are equal to the multiplicities 
of the roots ujj,. 

Problem 76. Without recourse to f|62l) . prove that a real rational function F maps the 
upper half-plane into itself if and only if 

n 

F(z) = az + (3 + V ^ , where a > 0, (3 E R, a k > 0, uj k eR. 

k=l 

Problem 77. Let A be a bounded self-adjoint operator on a Hilbert space 9j, let u G 9), 

and consider 

R(X) = ((XI - A)' l u,u) . (63) 

Prove that 

Im^(A) = 
Im A 

and therefore —R maps the upper half-plane into itself. 

Problem 78. Assuming that the operator of Problem [73 is continuous, prove that the 
corresponding function (l63l admits the decomposition 



R(\) 



\{u,e k )\ 



2 



where {ei,e2, ...} is an orthonormal basis consisting of eigenvectors of A, and 
{ui,u 2 , . . . } are the corresponding eigenvalues. 

Problem 79. Let R be a rational function that vanishes at infinity. Then it admits a 
Laurent series representation that converges for sufficiently large \z\: 
t}( \ s Si s 2 
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Prove that R is proper if and only if the coefficients sq, s±, S2, ■ ■ ■ satisfy 

n 

Sk = a j ljjk j f or some a%, . . . , a n > 0, U\ < • • ■ < uo n . (64) 
i=i 

ImR(z) 

Problem 80. If R G U, then — — > \R'(z)\ (Imz > 0). 

Im z 

Let 7 be a smooth curve lying in the upper half-plane 

11 = {z G C| \mz > 0}. 

Define a non- Euclidean curve length 7 by the formula 5(7) = J^(Imz) -1 |dz|. This makes 
the half-plane II into the Poincare model of the Lobachevsky plane (see [9], I14|). The 
geodesies of this plane are half-circles with center on the real axis E, and vertical rays. 
The Lobachevsky-Poincare plane is shown on the front page in a somewhat stylized form. 

Problem 81. If R EH, then the map —R : II — > H decreases curve lengths on IT: 
s(—R 07) < 3(7). 

az + 3 

Problem 82. let R G 1Z, a, 3,^,5 G K. TTien £/ie equation R(z) = - 

•yz + 

• /las on/t/ real solutions if ad — ^7 > 0; 

• has no more than one pair of complex conjugate solutions if ad — 3^y < 0; 

Problem 83. View the function p(z) = e z as being analogous to a polynomial. This 
entire function has no roots in the right half-plane (in fact, no roots whatsoever) so can 
be thought of as «stable». Can one apply the results of Theorem\7^ at least to some extent, 
to this function? Without succumbing to premature enthusiasm, consider also the function 
p(z) = e~ 



-z 



9. Hankel forms 

Given a real rational function^ R, let us associate with it a sesquilinear forn% H 
defined on the (infinite-dimensional) complex linear space V of all polynomials by the 
formula 



h(x, 2/) = y R (0 <0 v(0 d C & yen 



(65) 



Here y(() = y((), and 7 is a positively oriented closed contour enclosing all poles of the 
function R. By the Cauchy residue theorem, 

H ( x i y) = ^2 Res ^ ( Rx y) ( 66 ) 

where the summation is over all poles of R. 

A residue is especially easy to compute at a simple pole: 

7A AH 



fM = 0, /» + 0, h{uS) ± Res w (^j 



4 In contrast to §[3 where a rational function was considered as a map on the real projective line, 
here we consider it as an analytic function of a complex variable. 
H < — Hermite, Hankel, Hurwitz. 
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So, if all poles u\, . . . , uj n of the function R are real and simple, then 

n 

H(x,x) = ^^ak\x(cuk)\ 2 , where = Res ulk (R) E R. (67) 

k=l 

We thus reduced the Hermitian form H(x, x) to a sum of squares, and formula flBTl) shows 
that the rank rank if of this form (i.e., the total number of squares) is equal to n, the 
number of poles of the function R, whereas its signature sign ii (i.e., the difference between 
the number of positive and negative squares) is equal to Indl^(-R) (Hermite 1856). 

When the function R has multiple or nonreal poles, one fails to find such an expres- 
sive formula as (1671) . However, the qualitative connection remains valid: rank ii = deg/ , 
sign if = Indl^(i?) (Hurwitz, 1895). Let us try to penetrate the essence of this phenom- 
enon. 

Given a polynomial g G V, consider the subspaces 

V 9 = {gu : ue V}, V g = {veV: degv < degg}. 

The restrictions of the form H to these subspaces will be denoted by H 9 and H g , respec- 
tively. Long division of polynomials (x = gu + v, degv < degg) shows that@ 

V = V g © V g => codim V 9 = dim V g = deg g. (68) 

Remark 84. In addition to being a linear space, V is a commutative algebra: its elements 
are multiplied according to well-known rules (see |20L [4], [5], [6]). The subspace J = V 9 is 
an ideal of the algebra V: 

x e J,y eV => xy e J. 

We now assign the «inconvenient» poles of the function R = f\/ fo to a polynomial g: 

• if oo is a nonreal pole of R of order v lying, say, in the upper half plane, then it 
will be a root of g of multiplicity v (then aJ will be a root of g); 

• if uj is a real pole of R of order v > 1, then it will be a root of g of multiplicity 

L|J > i. 

As a result, the denominator / splits (assuming that / and fi have no factors in common) 
into the product 

fo = ggh, (69) 

where the roots of h are all simple and coincide with real poles of R of odd order. 

Now assume that x, y G V 9 , x = gx 9 , y = gy 9 , and let R 9 = ggR = fi/h. Then (1651 ) 
turns into 

h 9 (x, y) = ^~ f R 9 (0 x 9 (0 ¥(0 d(. (70) 



7 



All poles of the function R 9 are real and simple; therefore, by the argument above, 

rank H 9 = deg h, sign H 9 = Ind+^(i? 9 ). (71) 

On the other hand, the polynomial g~g is nonnegative on R, hence the functions R 9 and 
R have equal signs in the neighborhood of their common real poles, and hence 

Ind^(^) = Ind^(ii). (72) 



6 codimy is the codimension of a subspace Yof a linear space X. It is equal to the dimension of a 
maximal subspace of Z C X that intersects Y trivially (according to Zorn's lemma, such a subspace Z 
always exists). If dim X < oo, then codim Y = dimX — dim Y\ 
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We are now in a position to state and prove the main theorem of this section. 

Theorem 85. The rank of the form H is equal to the number of poles of the rational 
function R (counted according to their multiplicities) , and its signature coincides with the 
Cauchy index Indl^(i2). 

Proof. The following two lemmata are necessary and sufficient for our proof. 

Lemma 86 (rank), rank if = rank if 9 + 2 degg = deg f . 

Proof. The second equality follows from (l69l) and (1711) . We now prove the first. 
By the theory of quadratic and Hermitian forms (see |20L [4], [5], [6]), 

rank if = codimA/", where N = {x G V : H(x,y) = 0, Wy G V} 

is a subspace called the kernel of H. To describe the kernel, note that 

xeAf <{=> f mariO^'K (j = 0,l,2,...), 

7 

which means that Rx is a polynomial, x is divisible by the denominator / , and x G 
Thus, W = P /o codimA/ = deg/ • □ 

Lemma 87 (signature), sign H = sign H 9 . 

Proof. Denote the number of positive (resp., negative) squares of the form H by 
pos H (resp., neg H). The number pos ii coincides with the dimension of a maximal 
subspace where the form H(x,x) is positive definite (see |20L [4], [5], [6]); the same holds 
for pos H 9 , neg H, negH 9 . Lemma [86] implies that all these quantities are finite, and 
moreover, 

pos H + neg H = (pos H 9 + deg g) + (neg H 9 + deg g) . (73) 

Let V + be a subspace of V of dimension pos H, where H\p+ is positive definite. Then 
the restricted form H 9 is positive definite on the subspace V 9 fl V + , so that 

dim(V 9 C\V + ) < pos H 9 . (74) 

On the other hand, 

dim V + < dim(V 9 n V + ) + codim V 9 . (75) 

(Indeed, if Q is a maximal subspace of V + that intersects V 9 trivially, then dim Q = 
dimP + — dim (V 9 fl V + ) and dim Q < codim V 9 .) Since codim V 9 = degg, formulae (1741) - 
(1751) imply 

pos ii < pos ii ff + deg^ and, analogously, neg H < neg H 9 + degg, 
but (1731) implies that these must be equalities. Thus, 

sign H 9 = pos H 9 — neg H 9 = pos ii — neg H = sign ii. 

□ 

This concludes the proof of Theorem [85] as well: the statement about the rank follows 
from Lemma IB"6l and the statement about the signature follows from Lemma [871 and from 
equalities (ITTjl - flTl]) . □ 



Remark 88. We now understand the influence of nonreal and multiple roots that were 
collected into g: they give the form H some «ballast» consisting of an equal number (degg) 
of positive and negative squares, which brings up the rank of H. These rather nondescript 
squares get filtered out when H is restricted to the ideal V 9 of the algebra V. 
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Remark 89. Theorem l85l deals with Ind^(i?) - not with IndpR(-R), as in §[71 However, 
if deg/i < deg/o, the two indices coincide. 

So far, it was convenient^ to consider the form if on a complex infinite- dimensional 
space. We take a more practical position now. 

First of all, the form H is real: H(x,y) = H(x,y). This implies that, for real polyno- 
mials u and v, the equality H{u + iv,u + iv) = H(u, u) + H (v, v) holds. So, without loss 
of generality, V can be assumed to be a real space (or algebra) . 

Secondly, denote for simplicity f = fo and recall that V = Vf © V s (relation (1681) ) as 
well as V s = Af (the proof of Lemma [861) . This implies that 

H(x f + x f , y f + y f ) = H f (x f , y f ) (x f , y f G Vf, x f , y f G V s ). (76) 

Lemma 90. rank H = rank Hf, sign H = sign Hf 

Proof. With V + as in the proof of Lemma 1871 formula (JTBT) yields 

xeV + , x^O =^ H(x,x) = H f (x f ,Xf) > x f ^ 0. 

So, the projector x i— > Xf to the first component of the direct su is injective on 

V + , hence does not decrease its dimension. Therefore posii/ > posif, while the reverse 
inequality is obvious. Analogously, negif/ = negH. □ 

Thus, we can consider the form H on the finite- dimensional space Vf. However, Vf, 
unlike V, is not an algebra. On the other hand, the form Hf is necessarily nondegenerate: 

x G Vf, H f (x, y) = 0, \Jy G Vf ac = 0. 

Finally, consider a basis in P/ and write the form Hf in its canonical form. 
Let the basis consist of monomials (J — 0, 1, . . . , n — 1; n = deg f,f = f Q ). Then 
(1651 ) shows that 

n—l n— 1 

*(0 = E te* =► x ) = E ^-^'0- (77) 

where 

s k = ^-jR{QC k dC (fc = 0,l,2,...). (78) 

7 

If the rational function i? is expanded into a series 

R(0 = s^C- 1 + ■ ■ ■ + ? + S + ■ • • (m = degA - deg /o) (79) 



(which converges absolutely for large values of \(\ if m < 1), then, substituting ( 1791) into 
(1751 ) and integrating term by term, we verify that the coefficients Sk {k > 0) in ( 1791 satisfy 
conditions (1781) . 

An unusual property of (TfTl is that each of its coefficients s i+ j depends only on the 
sum of its indices. Quadratic forms of this type are called Hankel forms. 

The next theorem adds to our already large collection of statements that characterize 
rational functions from the class TZ: 



7 Does the reader see why? 

8 Simply, this is division of the polynomial x with quotient / and remainder x / . 
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Theorem 91. Let 

Then R e 1Z if and only if 



so £1 

C C 2 



(deg/i < deg/o = n). 



s > 0, 



50 Si 

51 s 2 



> 0, 



so 



Sn-1 



Sn-1 



S2n-2 



> 0. 



30) 



Proof. The condition R e 7?. is equivalent to the positive definiteness of the 
form (1771) . which is by Sylvester's criterion equivalent to ( 1801) . □ 



Remark 92. It would be interesting to find out how the determinants in (fSOl) can be 
expressed directly in terms of the coefficients of the polynomials fi and fo- We will take 
up this question in the next section. 

Problem 93. Prove that V is a principal ideal domain. The latter means |20L H], [5], [6] 

that any ideal J C V (see Remark\84\) is generated by some polynomial g G V: J = V 9 . 

Problem 94. Letf,geV and let d = gcd(/, g), k — lcm(/, g). Prove that V k = V s f)V 9 
andV d = V? + V 9 . This explains the real meaning and role of the terms ({greatest common 
factor» and «least common multiple». 

Problem 95. Under the assumptions of Problem [9H suppose the following inequalities 
hold as well: 



si > 0, 



si s 2 

S2 S 3 



> o, 



Si 



> 0. 



(81) 



S n ■ • • S2n~l 

What additional properties of the function R follow? 



Problem 96. Prove the Borchardt-Jacobi Theorem: Let f be a real polynomial of degree 
n with (complex) roots A i; . . . , \ n , and leu 



X k 1 + ... + X k n (A; = 0,1, 2, 



Then the number of positive squares of the form YliJ=o s i+j^j coincides with the number 
of distinct roots of the polynomial f , and the number of its negative squares with the 
number of distinct complex conjugate pairs of roots. 

Hint: One can apply the results of this and previous Sections to the logarithmic derivative 
of the polynomial /, but a direct argument is also possible. 



Problem 97. Derive Newton's identities that connect the Newton sums Sq, S\, 
the coefficients Oq, a±, . . . , a n of the polynomial f from the previous problem. 



with 



Hint: Prove the relation 



na ( 



n—l 



+ 



+ a n -\ 



a C n + ■ 
and make use of it. 



+ a r 



£o £i £2 



The quantities being denned here are called Newton sums. They are symmetric functions of the 
roots Xj; therefore they can be found without knowing the actual roots (see [7]). 
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Problem 98. As in ProblemlM let R(X) = ip ((XI - A^u), where A : <B 
bounded linear operator on a Banach space ti G (/) G 23'. Prove that 



03 is a 



where Sk = 



0,1,2, 



Problem 99. ,4s m problem\7% let R(X) = ((A/ — A) it;/iere A : — 

bounded self-adjoint operator on a Hilbert space Sj. Prove that the Hankel forms 



S) is a 



n-l 
i,j=0 



[n 



1, 2, . . . ), where Sk 



(A k u,u) 



(A; = 0,1, 



are nonnegative definite. In which case are they positive definite ? 

Problem 100. Let A be a real n x n-matrix whose elements and minors are all positiv^. 
Let Sk (k = 0,1,2, ... ) be the (1, 1) entry of the kth power A k of the matrix A. Prove that 
inequalities (l80l and (IBB hold. 

Hint: Find formulae connecting the determinants in fl80l)- (i8Ti ) with the minors of A. 



10. Lienard-Chipart criterion 



We now attempt to combine the ideas of the preceding three sections. 

The rank and the signature of a Hankel form are not projective invariants, since they 
do not account for a possible pole of the function R = f\j f at the point oo. However, if 
deg/i ^ deg/oi then R(oo) ^ oo and Indpj 

au + (3 



then 



and 



$ G SL(2, 



($o R)(z) 



afi(z)+/3f (z) 
7h(z) + 5f (z)> 



i(R) = Ind±™(R). Moreover, if, as in §0 
(a5-(3 1 =l), 



deg(a/i + Pf ) = deg(7/i + Sf ) = max {deg f u deg /„} 

for all $ G SL(2, R), except for exactly two elements: $ when <3> ° R has a zero at the 
point z = oo, and $oo if z = oo is a pole of $oo o i?. 

Lemma 101. // 



the 



R(z) = 


feo-2™ 


+ M 


71-1 


+ 


■■■ + b n 


= S-l 


S Si 

+ — + ^ 


c 2;" 


+ Ci-2 


71—1 


+ 


Vc n 






c 


Cl 


c 2 




■ C k -1 


Cfe 


• • • C2fc_l 






&0 


6i 






■ 6fc-i 




• • • b 2 k-i 









Co 


Cl 






Cfc-l 


■ ■ ■ C2k-2 









6o 


bi 




• &fc-2 


bk-i 


■ ■ ■ b 2 k-2 

















• c 


Cl 


c k 

















• b 


61 


... b k 




1, 


2, 


. . . ,n 


J 


as 


in §\H we set 


c j = bj = 



+ 



(c + 0), 



(82) 



cfdet[s i+j ] k 1 



for j > n) 



(83) 



'Such matrices are called totally positive; for details, see |10| . 
Lo and behold the Hurwitz matrix! 
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Proof. First interchange the rows of the determinant in (l83l) to obtain 



2fc 



C C\ 

c 



c 2 







b h 

b h b 2 



Cfc-l 
Cfc-2 



Ck 



c V c i 
b v 7 & i 



6fc-2 



h-i 
h 



C2fc~l 
C2fc-2 



Cfc 



^2fc-2 
&2fc-l 



(84) 



This does not change the sign of V2fc. Indeed, lower the kth and (A;+l)st rowo to their 
initial positions in ( l83l) . This will require an ewen number of transpositions. The next pair 
of rows will then meet, the lowering operation will be applied to them, and so on. 

Now let us establish a connection between b, c and s. Multiplying ( 1821) by the denom- 
inator and equating coefficients, we get: 



(j = 0,1,2,...)- 



(85) 



i=0 



These formulae suggest by themselves what to do next, namely, to eliminate the entries 
in the lower left corner of the determinant (1841) . From each k + jth row (j — 1,2, ... , k) 
subtract rows k — j + 1, k — j + 2, . . . , k multiplied by s_i, s , • 
a result, we get zeros down and to the left, and the entry 



Sj-2, respectively. As 



dij 



CiSj- 



-lSf-1 



• • + CoSi + j-2 

in position (k + i, k + j). In matrix form, this will look as follows: 



1,2, 



k) 



d 2 i 



d\2 
d 2 2 



dhi d 



k2 



dik 
d 2 k 

dkk 



Si 



Si 

s 2 



Sfc-1 Sfc 



Sfc_l 
Sfc 

S 2 k-2 



c 




Cl 

Co 







Cfc-l 
Cfc-2 

C 



□ 



Remark 102. The quantities V2fc = V2fc(-R) are invariants under the action of the group 
SX(2,IR), since the structure of determinants ( 1831) implies that V2fc(-R + d) = V 2 k(R) 
(d = const) and V 2 k{—^) = V 2 fc(i?) due to Lemmata 7.1-7.3. 

Remark 103. V 2n coincides, up to a sign, with the resultant of the polynomials fi and 
fo- Hence 

V 2 „^0 <{=> gcd(/ ,/i) = l 

More about the resultant and other symmetric polynomials can be found in [7]; this topic 
is worth studying per se, but we do not need to invoke external results to justify the fact 
we just stated. Indeed, the inequality V 2n ^ together with Lemma ["1011 imply that the 
rank of the corresponding Hankel form H, which coincides with the total number of the 
poles of the function fi/fo, is at least deg/o, hence the fraction /i//o is in lowest terms. 
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They are marked by \J in (|84| . 
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Lemma 104. With the notation of Lemma \101\ 

V 2 > 0, V 4 > 0, . . . , V 2n > =^ lva R{z) < Q 

lm z 

If n is the number of poles of the rational function R, then the converse holds as well. 

Proof. If b Q = 0, apply Lemma fTOTl Theorem I9T1 and Theorem [721 If b Q ^ 0, apply 
the same results to the function R(z) — s_i. □ 

We already noticed that the determinants V2/C (k — 1, 2, . . . , n) are the even leading 
principal minors of the 2n x 2n matrix built exactly as the Hurwitz matrix T~L P from § 6. 
This is not just a superficial similarity. Let us go back to the main object of our study, 
viz., the real polynomial 

p(z) = a z n + a x z n - 1 H h a n (a > 0) (86) 

and let us rewrite it as 

P{z) = g (z 2 ) + zg^z 2 ). 

The polynomials go and g\ so defined are very much reminiscent of fo and fi , which first 
appeared in §[3] and were studied in detail in § §E1 IS If the degree n is odd n = 2m + 1, 
then 

g (w) = aiw m + a 3 w m_1 H h a 2m +i, 

gi(w) = a w m + a 2 w m ~ 1 H h a 2m ] 

if n = 2m, then 

go(w) = a w m + a 2 w m ~ 1 H h a 2m , 

gi(w) = axw™- 1 + a 3 w m - 2 H h a 2m -i- 

In either case, deg^o > deg^i, and the minors V 2 k = ^2k{gi/ go) and the minors % of 
the Hurwitz matrix TC P (see §[6]) are connected as follows^: 



2k 



\V2k (k — l,...,m), if n = 2m + l 



\aoV2k-1 {k = l,...,m), if n = 2m. 

If, as usual, R = gi/go, then 



(87) 



p( z ) = R(z 2 ) = --. (88) 

We are now ready to prove the Lienard-Chipart criterion, which was mentioned in §[6j 
Theorem 105 (Lienard-Chipart). A polynomial (1861) is stable if and only if 

a n > 0, a n -i > 0, a n _ 2 > 0, ... (89) 

Vn-l > 0, Vn-3 > 0, Vn-5 > 0, ... (90) 

PROOF. Necessity: follows from the Stodola condition and the Hurwitz criterion. 
Sufficiency: 1. Since the minors are positive, Lemma fl 041 and formula ( 1871 ) imply 

lm R(z) . . 

— — ^<0 Imz^O 
lmz 



Note that in both cases the last V2 m corresponds to the second-last Jfo-i, which, as we know from 
§[H «guards» the imaginary axis! 



40 



So, if p(z) = 0, Imz ^ 0, then ((881) and ((901) give 

lmR(z 2 ) Imz 1 

— = r~i = — r~i < 0- 

Imz 2 | z | 2 Imz 2 2 1 z \ 2 Re z 

Thus, by (1901 . the roots of the polynomial p lie in the union of the open left-hand plane 
and the real positive half-line. 

2. Since the coefficients are positive, the polynomial p cannot have any roots on the 
nonnegative side of the real axis. □ 

Remark 106. If we drop (1851) but keep (1901 ) in Theorem 11051 then some roots of p may 
cross over into the right half-plane, but in that case they must stay on the positive real 
half-line. From the stability point of view, this behavior is very interesting, since it means 
that the fixed point does not bifurcate into a limit cycle (these bifurcations are described 
in detail in [21]). 

Problem 107. For a polynomial discussed in Remark \l06i whose roots with positive real 
part cannot leave the real axis, it is natural to expect that these roots stay simple ( a simple 
root cannot leave the real axis other than by coalescing with another root). Prove that this 
is indeed so, i.e., that the positive roots of a polynomial satisfying (1901 are simple. 

Afterword 

Half a page is still left - it would be sinful to leave it blank. Let us draw conclusions. 

In stability theory, the Routh-Hurwitz problem, which was considered in these «lecture 
notes» from many points of view, certainly does not play a role commensurate with the 
attention we devoted to it; more precisely, it does not yet play a role it is destined for. 
Destined by whom? I don't know. Still, I believe that intrinsically beautiful mathematical 
constructs must be necessarily connected to the understanding and explanation of the 
real world that surrounds us. If you wish, you may label this a mathematical religion 
of sorts; I think many mathematicians, perhaps most, are such conscious or unconscious 
believers. It is impossible to imagine that the unreal world studied by mathematics has 
been created by human intellect; mathematicians do not invent theorems and theories 
but discover them. And this ideal world of mathematical constructions always turns out 
to parallel the «real» one; remarkably, its researchers are driven neither by «logic», as 
laymen think, nor by «applied» needs, nor even by acquired experience and knowledge, 
both certainly indispensable. They are driven by an irrational, strange intuition that lets 
them feel that intrinsic beauty and harmony, just as our senses can feel warmth and 
determine its source. It is true that the mathematician Hurwitz was «handed» a problem 
by the turbine engineer Stodola, but Hurwitz took up and solved that problem not to 
help Stodola build his turbines. Well, he would have not taken it up just for that. Such 
is indeed the relationship between mathematics and its various «applications»: the latter 
are sources of problems for the former, and the origin of these problems is a certain a 
priori guarantee of the harmony and beauty to be found there, and of the progress in 
mathematics that their solutions must bring about. 

Returning to the beginning of our scholia, let me venture an opinion: the sad fact that 
the «Routh-Hurwitz problem» is not much in demand does not mean that it is only of 
academic interest. This should rather mean that there is a hidden door behind which there 
may be lots of interesting stuff. Just as in the instructive story of "The golden key'Q- :-) 



"The golden key" by Alexei Tolstoi, a famous Russian adaptation of the book "The adventures of 
Pinocchio" by Carlo Collodi [translators' remark]. 
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Comments^ The books |11L I27L I18| offer a thorough treatment of the subject. The book 
|27| is most elementary and detailed. The monograph [11] remains one of the best matrix 
theory books in the world. The paper |18| contains an exhaustive review of classsical works 
on stable polynomials. The textbook |8j states the Hurwitz theorem and the «Mikhailov 
criterium». However, the relevant sections of this overall good book are in my opinion not 
quite satisfactorily, and it is better to use |27j. 

The monographs pQ, [7] are devoted to the Routh-Hurwitz problem for entire functions. 

The books [22], [22, EH EH [26] of G. Polya mentioned in the Introduction are not 
directly related to our topic of stable polynomials, but their reading is useful for every 
beginning mathematician. 

The book |12| is outdated and is written «for dummies». However, the now forgotten 
language APL, which was created by Kenneth Iverson not exactly as a practical pro- 
gramming language but rather as a notation system for mathematical algorithms, is per 
se interesting to a mathematician. 



15 Many of the author's original references, in Russian or translated into Russian, were replaced by 
the corresponding references in English [translators' remark]. 



